منابع مشابه
MPEG-7 sound-recognition tools
The MPEG-7 sound-recognition Descriptors and Description Schemes consist of tools for indexing audio media using probabilistic sound models. The Descriptors provide containers for category labels, as well as data structures for quantitative information about sound content. We describe the normative tools, as well as informative methods, for automatic description extraction and sound matching.
متن کاملHow Efficient Is Mpeg-7 for General Sound Recognition?
Our challenge is to analyze/classify video sound track content for indexing purposes. To this end we compare the performance of MPEG-7 Audio Spectrum Projection (ASP) features based on several basis decomposition algorithms vs. Mel-scale Frequency Cepstrum Coefficients (MFCC). For basis decomposition in the feature extraction we evaluate three approaches: Principal Component Analysis (PCA), Ind...
متن کاملMPEG-7 MDS Content Description Tools
In this paper, we present the tools specified by the MDS part of the MPEG-7 standard for describing multimedia data such as images and video. In particular, we focus on the description tools that represent the structure and semantics of multimedia data to whose development we have actively contributed. We also describe some of our research prototype systems dealing with the extraction and appli...
متن کاملSpeaker recognition using MPEG-7 descriptors
Our purpose is to evaluate the efficiency of MPEG-7 audio descriptors for speaker recognition. The upcoming MPEG-7 standard provides audio feature descriptors, which are useful for many applications. One example application is a speaker recognition system, in which reduced-dimension log-spectral features based on MPEG-7 descriptors are used to train hidden Markov models for individual speakers....
متن کاملInstrument Sound Description in the Context of MPEG-7
We review a proposal made in the framework of MPEG-7 for the description of instrument sounds based on perceptual features. Results derived from experiments dealing with the perception of musical timbre allow us to approximate the main underlying perceptual dimensions from signal descriptors. A combination of these signal descriptors allows a comparison between sounds in terms of an estimated p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2001
ISSN: 1051-8215
DOI: 10.1109/76.927433